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Thermophilic Geobacillus spp. can efficiently hydrolyze hemicellulose polymers and are therefore of interest in biotechnological 
applications. Here we report the genome sequences of two hemicellulolytic strains, Geobacillus sp. CAMR12739 and CAMR5420. 


Received 19 May 2014 Accepted 20 May 2014 Published 5 June 2014 

Citation De Maayer P, Williamson CE, Vennard CT, Danson MJ, Cowan DA. 2014. Draft genome sequences of Geobacillus sp. strains CAMR5420 and CAMR12739. Genome 
Announc. 2(3):e00567-1 4. doi:1 0.1 1 28/genomeA.00567-1 4. 

Copyright © 2014 De Maayer et al.This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. 
Address correspondence to Don A. Cowan, don.cowan@up.ac.za. 


Members of the genus Geobacillus are Gram-positive, thermophilic, 
spore-forming, aerobic bacteria (1). They have an extensive capac- 
ity to degrade plant cell wall hemicellulose polymers into their compo- 
nent pentose sugars, a capacity that has been linked to a single chromo- 
somal locus (2, 3). Coupled with this is their production of thermostable 
hydrolytic enzymes, resulting in considerable interest in the use of Geo- 
bacillus in a range of biotechnological applications (4). Geobacillus sp. 
strains CAMR5420 and CAMR12739 were acquired from the CAMR 
(Porton Down, United Kingdom) thermophile culture collection, which 
is now held by the Centre for Extremophile Research, Department of 
Biology and Biochemistry, at the University of Bath, United Kingdom. 
CAMR5420 was added to the collection in 1989; CAMR12739 was col- 
lected in Skaltholt, Iceland, and deposited in the collection in 1987. 

Genome sequencing was performed using the 454 GS-FLX 
(CAMR12739) and Illumina GAIIx (CAMR5420) platforms, 
yielding 2,533,592 reads (average read length of 138 nucleotides 
[nt]) for CAMR53420 and 119,419 reads (average read length of 
365 nt) for CAMR12739. Assembly was performed using a com- 
bination of the CLC Genomics Workbench v6, DNAstar Ngen 
and Velvet vl.l (5) assemblers. Finally, reference assembly was 
undertaken using the complete genomes of related Geobacillus 
spp. using Mauve (6). The genome of CAMR5420 was assembled 
into 96 contigs, yielding a total genome size of 3.50 megabases, 
with a mean G+C content of 51.89%. The genome of 
CAMR12739 was assembled into 74 contigs, with a total genome 
size of 3.41 Mb and a mean G+C content of 52.21%. Further in 
silico assembly was hampered by the presence of highly conserved 
transposase genes. The genomes were annotated using the Rapid 
Annotations using Subsystems Technology (RAST) server (7). 
The genomes of CAMR5420 and CAMR12739 code for 3,816 and 
5,836 proteins, respectively. A single, but variable, locus underly- 
ing the capacity of these strains to hydrolyze and utilize hemicel- 
lulose polymers was identified. The genomes of Geobacillus sp. 
CAMR5420 and CAMR12739 will be employed for comparative 
genomic analyses, with the goal of developing recombinant Geo- 
bacillus strains with optimal hemicellulose degrading capacities. 

Nucleotide sequence accession numbers. These whole-genome 
shotgun projects have been deposited at DDBJ/ENA/GenBank under the 


accession no. JHUSOOOOOOOO (CAMR5420) and JHUROOOOOOOO 
(CAMR12739). The versions described in this paper are the first 
versions, JHUSO 1000000 (CAMR5420) and JHURO 1000000 
(CAMR12739). 
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